Picture for Jinghan Li

Jinghan Li

LamPO: A Lambda Style Policy Optimization for Reasoning Language Models

Add code
May 20, 2026
Viaarxiv icon

LambdaPO: A Lambda Style Policy Optimization for Reasoning Language Models

Add code
May 19, 2026
Viaarxiv icon

Think, then Score: Decoupled Reasoning and Scoring for Video Reward Modeling

Add code
May 07, 2026
Viaarxiv icon

Beyond Where to Look: Trajectory-Guided Reinforcement Learning for Multimodal RLVR

Add code
Mar 27, 2026
Viaarxiv icon

Bridging Perception and Reasoning: Token Reweighting for RLVR in Multimodal LLMs

Add code
Mar 26, 2026
Viaarxiv icon

Enhancing Multi-Modal LLMs Reasoning via Difficulty-Aware Group Normalization

Add code
Feb 26, 2026
Viaarxiv icon

Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations

Add code
Dec 24, 2025
Figure 1 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Figure 2 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Figure 3 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Figure 4 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Viaarxiv icon

AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization

Add code
Apr 22, 2025
Viaarxiv icon

DAMO: Data- and Model-aware Alignment of Multi-modal LLMs

Add code
Feb 04, 2025
Figure 1 for DAMO: Data- and Model-aware Alignment of Multi-modal LLMs
Figure 2 for DAMO: Data- and Model-aware Alignment of Multi-modal LLMs
Figure 3 for DAMO: Data- and Model-aware Alignment of Multi-modal LLMs
Figure 4 for DAMO: Data- and Model-aware Alignment of Multi-modal LLMs
Viaarxiv icon

DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector

Add code
Oct 09, 2024
Figure 1 for DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector
Figure 2 for DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector
Figure 3 for DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector
Figure 4 for DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector
Viaarxiv icon